Open Government Data and Beer Analytics

🏛 + 🍺

Open Data Science Conference | East

Jasmine Dumas | @jasdumas | jasdumas.github.io

Friday, May 5th, 2017

Hi Boston!

Why should you care open government data?

To foster and improve data literacy & statistical comprehension within our community, Scientists and Engineers need to advocate for data that is provided in an consistent and accessible method suitable for analysis.

What this presentation is about (and not about)

2017 has been an interesting time to be a Data Scientist…

What even is, beer analytics?

Discovering analysis-ready beer datasets can be difficult

How to generally search for analysis-ready datasets

The U.S. Government Open Data Portal

The “clearinghouse” for open U.S. government data is located at data.gov. It also contains tools & resources to conduct research, develop web and mobile applications, and design data visualizations.

Examples of analysis-ready datasets

Examples of datasets that are not analysis-ready

Here is the point of all of this…

Just because it’s open doesn’t mean it’s accessible!

Necessity breeds innovation

I developed a R package for beer statistics, called ttbbeer

How I developed the ttbbeer v1.0.0 package

##      Month Year Malt_and_malt_products Corn_and._corn_products
## 1  January 2015              316756669                60186876
## 2 February 2015              294630835                49718568
## 3    March 2015              320802302                53925263
##   Rice_and_rice_products Barley_and_barley_products
## 1               52438465                    8492680
## 2               48975137                    9355462
## 3               48710754                   22213428
##   Wheat_and_wheat_products Sugar_and_syrups Hops_dry Hops_extracts
## 1                  1392183         72069615  1916516        298514
## 2                  2145359         74431855  1775144        298894
## 3                  3761816         82679762  3750830        321462
##      Other
## 1 11163548
## 2 12017223
## 3 13892190

How I developed the ttbbeer v1.1.0 package

Insights from this data

Advocating for open data

https://www.data.gov/issues/request-id/32022/

https://www.data.gov/issues/request-id/32022/

Wrapping things up

Questions & Discussion!

ending slide